Modeling pronunciation variation for a dutch CSR: testing three methods
نویسندگان
چکیده
This paper describes how the performance of a continuous speech recognizer for Dutch has been improved by modeling pronunciation variation. We used three methods to model pronunciation variation. First, within-word variation was dealt with. Phonological rules were applied to the words in the lexicon, thus automatically generating pronunciation variants. Secondly, cross-word pronunciation variation was modeled using two different approaches. The first approach was to model cross-word processes by adding the variants as separate words to the lexicon and in the second approach this was done by using multi-words. For each of the methods, recognition experiments were carried out. A significant improvement was found for modeling within-word variation. Furthermore, modeling crossword processes using multi-words leads to significantly better results than modeling them using separate words in the lexicon.
منابع مشابه
Improving the Performance of a Dutch Csr by Modeling Pronunciation Variation
This paper describes how the performance of a continuous speech recognizer for Dutch has been improved by modeling pronunciation variation. We used three methods in order to model pronunciation variation. First, withinword variation was dealt with. Phonological rules were applied to the words in the lexicon, thus automatically generating pronunciation variants. Secondly, cross-word pronunciatio...
متن کاملImproving the performance of a Dutch CSR by modeling within-word and cross-word pronunciation variation
This article describes how the performance of a Dutch continuous speech recognizer was improved by modeling pronunciation variation. We propose a general procedure for modeling pronunciation variation. In short, it consists of adding pronunciation variants to the lexicon, retraining phone models and using language models to which the pronunciation variants have been added. First, within-word pr...
متن کاملModeling Within-word and Cross-word Pronunciation Variation to Improve the Performance of a Dutch Csr
This paper describes how the performance of a continuous speech recognizer for Dutch has been improved by modeling within-word and cross-word pronunciation variation. Within-word variants were automatically generated by applying five phonological rules to the words in the lexicon. For the within-word method, a significant improvement is found compared to the baseline. Cross-word pronunciation v...
متن کاملMaking a difference On automatic transcription and modeling of Dutch pronunciation variation for automatic speech recognition
The first goal of this study is to investigate the effect of several properties of acontinuous speech recognizer (CSR) on automatic phonetic transcription. Our resultsshow that changing certain properties of the CSR affects the resulting automatictranscriptions. The quality of the automatic transcriptions can be improved by using‘short’ HMMs and by reducing the amount of contami...
متن کاملPronunciation variation in ASR: which variation to model?
This paper describes how the performance of a continuous speech recognizer for Dutch has been improved by modeling within-word and cross-word pronunciation variation. A relative improvement of 8.8% in WER was found compared to baseline system performance. However, as WERs do not reveal the full effect of modeling pronunciation variation, we performed a detailed analysis of the differences in re...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1998